Combining Timbric and Rhythmic Features for Semantic Music Tagging
نویسندگان
چکیده
The recent explosion of online streaming services is a reflection of the high importance the multimedia has in our everyday life. These systems deal with large collections of media pieces and their usability is tightly related to the meta-data associated with content. As long as the meta-data is correctly assigned users can easily reach what they were looking for. The presence of poorly annotated data and continuous addition of new items can lead to poor recommendation performances and loss of earning because a large part of the collection will remain unexplored . Automatic tagging can be a step toward the solution for this problem, however state of the art system are not yet ready to substitute human annotators, expecially for music. In this thesis we propose a novel approach to semantic music tagging. The project takes inspiration from Hidden Markov Models and exploits a statistical framework to semantically link two acoustic features. We make the assumption that acoustically similar songs have similar tags. We model our collection of known songs as a graph where the states represent the songs and the transition probabilities are related to the timbric similarity between songs. Observations and observation probabilities was modeled by rhythm descriptors and rhythm similarity. To query the model we simulate the insertion of the query song in the graph by calculating timbric and rhythmic similarity with the collection. A modified Viterbi algorithm was developed to extract semantically meaningful paths in this songs graph, from the query song. We infer the tags for the query by a weighted sum of tags from the songs in the extracted paths. We tested our model using the CAL500 dataset, a well known dataset for music information retrieval evaluation, and results are promising. Performance measures are good for an first implementation of a new model. This thesis is intended to show the results of our work between august, 2012 and march, 2013 on this project.
منابع مشابه
The rhythm transform: towards a Generic rhythm Description
In the past few years, automatic genre classification has become one of the most interesting topics in the Music Information Retrieval (MIR) field. Musical genre is one of the most valuable metadata when managing huge music databases and many successful efforts have been done to automatically compute it. Basically, timbric and rhythmic description of audio is used for this purpose. From our poi...
متن کاملHomayoun as a Persian Music Scale on Non-Musician’s Brain: an fMRI Study
Introduction: The aim of this study was to get to a neurological evaluation of one of the Persian music scales, Homayoun, on brain activation of non-musician subjects. We selected this scale because Homayoun is one of the main scales in Persian classical music which is similar to minor mode in western scales. Methods: This study was performed on 19 right handed subjects, Aging 22-31. Here some ...
متن کاملAutomatic Music Tagging With Time Series Models
We present a system for automatic music annotation that leverages temporal (e.g., rhythmical) aspects as well as timbral content. Our system estimates a dynamic texture mixture (DTM) density over times series of acoustic features (instead of on individual features) for each tag in a semantic vocabulary. When analyzing a new song, our system processes the time series of acoustic features of the ...
متن کاملA comparsion the two methods effect on divagation of mind: Music Hay–Ho Rhythmic breathing tchninqe
Background: Intravenous cannulation is one of the most common painful procedures that is being used as a routine part of primary care or during diagnostic workups. This procedure is an invasion to physical and psychological space of children and it’s pain must be managed. This clinical trial has been conducted to determine and compare the effects of two music and Hay – Ho Rhythmic Breathing tec...
متن کاملTowards Characterisation of Music via Rhythmic Patterns
A central problem in music information retrieval is finding suitable representations which enable efficient and accurate computation of musical similarity and identity. Low level audio features are ideal for calculating identity, but are of limited use for similarity measures, as many aspects of music can only be captured by considering high level features. We present a new method of characteri...
متن کامل